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Abstract. A More Sums Than Differences (MSTD) set is a set of integers A c {0, . . . , n — 1} 
whose sumset A + A is larger than its difference set A — A. While it is known that as n — > oo 
a positive percentage of subsets of {0, . . . , n — 1} are MSTD sets, the methods to prove this are 



(N 



probabilistic and do not yield nice, explicit constructions. Recently Miller, Orosz and Scheinerman 
HMOS II gave explicit constructions of a large family of MSTD sets; though their density is less than a 
positive percentage, their family's density among subsets of {0, . . . , n — 1} is at least C/n 4 for some 
C > 0, significantly larger than the previous constructions, which were on the order of 1/2™/ 2 . We 
generalize their method and explicitly construct a large family of sets A with |A + A + yl + A| > 
\(A + A) — (A + A)\. The additional sums and differences allow us greater freedom than in OMOSL 
and we find that for any e > the density of such sets is at least C/n e . In the course of constructing 
such sets we find that for any integer k there is an A such that lA + A + A + ylj — | A + yl — A — A\ = k, 
^ c*| and show that the minimum span of such a set is 30. 
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1. Introduction 

Many problems in number theory reduce to understanding the behavior of sums and differences 
of a set with itself, where for a set A the sumset is 

A + A = {x + y: x,y E A} (1.1) 

and the difference set is 

A - A = {x-y.x,yeA}; (1.2) 

if A is finite we denote the number of elements of A is denoted by Examples include the 
Goldbach problem (if V is the set of all primes, then V+V contains all the even numbers), Waring's 
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problem, the Twin Prime Conjecture (there are infinitely many representations of 2 in V — V), and 
Fermat's Last Theorem. In studying these additive problems, it is natural to compare \A + A\ and 
\A — A\. If the sumset is larger, we say A is sum-dominant, or a More Sums Than Differences 
(MSTD) set. While such sets were known to exist (see llHel IMal iNaTl INa2l lNa3l iRuTl lRu2l lRu3ln 
it was thought that they were rare. Specifically, it was believed that as n — > oo the percentage 
of subsets of {0, . . . , n — 1} that were sum-dominant tends to zero. Martin and O'Bryant HMOU 
recently disproved this, showing that a positive percentage of sets are sum-dominant. They showed 
the percentage is at least 2 ■ 1CT 7 , which was improved by Zhao HZh21 to at least 4.28 • 10~ 4 (Monte 
Carlo simulations suggest that approximately 4.5 • 10~ 4 percent are sum-dominant). See HILMZ1I 
for a survey of the field, where these and other results (such as those in HHM1[[HM2H . which deal 
with varying the probability measure on {0, . . . , n — 1}) are given. 

It is natural to ask whether or not there is an explicit construction of large families of MSTD sets. 
Unfortunately, the proofs in HMQ[ IZh21 are probabilistic, and do not lend themselves to a clean 
enumeration of such sets. Miller, Orosz and Scheinerman HMOSI gave an explicit construction 
of MSTD sets (shortly thereafter, Zhao UZhll gave a new method as well). Previously the largest 
explicit families had, in the limit, at least f{n)/2 n / 2 of the 2 n subsets of {0, . . . , n — 1} being sum- 
dominant (with / a nice polynomial). The construction in HMOSM gives an explicit family of size 
at least C 4 /n 4 , which was improved to C\jn in UZhlH . The purpose of this paper is to extend the 
method in HMOSI to generalized MSTD sets. While our families will not be a positive percentage, 
we see in Theorem [Tj] that we can preserve the simplicity of the construction but improve the result 
to missing by an arbitrarily small power. 

Before explaining Miller, Orosz and Scheinerman's construction, we first set some notation. 

• We let [a, b] denote all integers from a to b; thus [a, b] = {n E 7h : a < n < b}. 

• By k A we mean A added to itself k — 1 times: 

kA = A + --- + A. (1.3) 
v ' 

k times 

• We say a set of integers A has the property P n (or is a P n -set) if both its sumset and its 
difference set contain all but the first and last ^possible elements (and of course it may or 
may not contain some of these fringe elements) □ Explicitly, let a = min A and b = max A. 
Then A is a P n -set if 

[2a + n, 2b - n] C A + A (1.4) 

and 

[-(b-a) + n, (b-a)-n] C A - A. (1.5) 



Essentially, their method is as follows (see HMOSI for the full details). Let A be an MSTD set, 
and write A as L U R, where L is the left fringe and R the right fringe; for convenience, we assume 
\L\ = \R\ = n = \A\/2 and 1, 2n e A. Let O k = [1, k] = {1, . . . , k}, and for any M of length m 



It is not hard to show that for fixed < a < 1 a random set drawn from [1, n] in the uniform model is a Py an \ -set 
with probability approaching 1 as n —> oo. 
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A(M) = LU (n + O k ) U (n + k + M) U (n + k + m + O k ) U (n + 2k + m + R), (1.6) 

where a + S is the translate of S by a. If A is a P„, MSTD set, then A(M) is an MSTD set, 
so long as M contains one out of every k consecutive elements. The reason this is true is due to 
the fact that we have two intervals containing k consecutive elements, and these guarantee that all 
possible sums are realized as M never misses k consecutive elements. This controls the middle of 
A(M) + A(M); the fringes are controlled by L and the translate of R. One way to ensure M never 
misses k consecutive elements is to divide M into m/(k/2) consecutive blocks of size k/2 (assume 
||m), and note that each block may be any non-empty subset of a translate of [1, k/2]. The number 
of valid choices for all the blocks is 



the factor hitting 2 m measures how much we lose from our condition. There is also a loss from 
having two translates of Ok', we could have had 2 2k possible sets here, but instead have a fixed 
choice. Letting r = 2n + 2k + m and optimizing m and k, Miller, Orosz and Scheinerman show 
their family has density at least C/r 4 for some C > 0. 

An essential ingredient in HMOSM is the existence of one P n , MSTD set A. While it is not hard 
to find such a set by brute force enumeration, this becomes tricker for the generalized problems 
we now consider. Instead of looking at | A + A\ versus | A — A\, one could study | A + A + A\ and 
or and While the methods of IMQSJ generalize to these 

(and additional) cases, the increased number of additions and subtractions provide opportunities 
that were not present in A + A and A — A, and significantly larger families can be explicitly 
constructed once an initial set is found. For definiteness in this paper we mostly study sets with 
|A + A + A + A| > \A + A — A — A\, and we give an example where this holds. For general 
comparisons, Iyer, Lazarev, Miller and Zhang recently proved existence and positive percentage 
(see HILMZ 1 [ IILMZ2H for the construction). Our main result is the following. 

Theorem 1.1. For all e > 0, there is a constant C e > such that as r goes to infinity, the percentage 
of subsets A of[l, r] with \2A + 2A\ > \2A - 2A\ is at least CJr e . 

Remark 1.2. It is worth noting that Theorem ll . l\ gives us a higher percentage family of generalized 
MSTD sets (with \2A + 2A\ > \2A-2A\) than MSTD sets. Our methods generalize to \AA + AA\ > 
\4A — 4A\ (among other comparisons). 

In the course of proving Theorem ll.il our tools immediately yield 

Theorem 1.3. Given i£Z there exists an S x with \2S X + 2S X \ — — 2S X \ = x. 

In other words, we can construct these generalized MSTD sets such that we have arbitrarily more 
sums than differences. 

In §[2] we go through (in full detail) the calculation needed to generalize HMOSB . and obtain a 
lower bound for the probability of C/r 4//3 . We improve this to C /r e for any e > in §[3l and then 
end in Ml by showing we can find sets such that the size of the generalized sumset is any desired 
number greater than (or less than) the generalized difference set. Not surprisingly, as the bounds for 
the density of these generalized MSTD sets improve, our constructions become more complicated; 
this is why we provide full details and a description of the method for the weaker results. 




(1.7) 
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2. Constructing many A with \2A + 2A\ > \2A — 2A\, I 

In this section we generalize the construction in HMOS1 : we greatly improve the percentage in 
the next section. Here we prove 

Theorem 2.1. There is a constant C > such that as r goes to infinity, the percentage of subsets 
of [1, r] with \2A + 2A\ > \2A - 2A\ is at least C/r A/3 . 

We first describe our search for one set with the desired properties (as our approach may be of use 
in finding sets needed for other problems), then discuss some lemmas needed to generalize Miller, 
Orosz and Scheinerman's construction. 

We started by searching for a single set with | A + A + A + A | > | A + A — A — A\ ; from now on we 
use the notation AA to denote A + A + A + A and 2 A - 2A to denote A + A - A - A. We generated 
random subsets of [1 , 40] , including each number with probability 1/4, and checked if the generated 
sets had our desired property. We quickly found A = {6, 7, 9, 10, 13, 32, 35, 36, 38, 39, 40}, which 
has \ A + A + A + A\ = 136 and \A + A — A — A\ = 135. 

In order to construct an infinite family from one set A using the techniques of HMOSL A must 
satisfy two properties: 

• The set A must be a subset of [1, 2n] containing 1 and 2n. 

• The set A must be a P A set; meaning that AA and 2A — 2A contain at least all but the first 
and last n possible elements. 

While we can subtract 5 from each element in our set, to have it start at 1 without affecting the 
number of sums and differences, it then ranges from 1 to 35 and 35 is not even. Though we could 
restructure their construction to avoid needing the first condition, our set does not meet the second 
condition either. We then looked for further ways to modify our set, hoping to find a set that had 
\AA\ > \2A — 2A\ and meet their second condition. By taking our set and adding it to {0, 49} (that 
is repeating each element shifted by 49£|, we found our desired set. With n = 42, 

A = {1,2,4,5,8,27,30,31,33,34,35,50,51,53,54,57,76,79,80,82,83,84} C [l,2n], (2.1) 

1, 2n e A, and 

AA = [4, 336]\{27} D [n + 4,7n], 2A-2A = [—166, 166]\{141, —141} D [—3n + 1, 3n]. 

(2.2) 

This set thus meets all of the required properties to use a modified version of Miller et. al's con- 
struction of an infinite family of sets with \AA\ > \2A — 2A\. To do so, we first need to prove two 
lemmas, similar to their Lemma 2. 1 and Lemma 2.2. 

Lemma 2.2. Let A = LURbe a P A set where L C [l,n}andR C [n+l,2n]. Form A' = LllMUR' 
where M C [n + 1, n + m] and R' = R + m. If A' is a P A set then \AA'\ - \AA\ = \2A' - 2A'\ - 
\2A-2A\ = Am and thus if\AA\ > \2A - 2A\ then \AA'\ > \2A'-2A'\. 

The utility of this lemma is that if A were also a generalized MSTD set (with \AA\ > \2A — 2A\), 
then A' would be a generalized MSTD set as well. 

Proof. We first consider the number of added sums. Just as in HMOSI . in the interval [4, n + 3], AA 
and AA' are identical as all elements come from L + L + L + L. Also, we can pair the elements 

2 While it is expected that A + {0, a}, a > 4(34) = 136 would still have |4A| > \2A - 2A\ (since the two repetitions 
of A would never interact) it is surprising that A + {0, a} still has \4A\ > \2A — 2A\ for many smaller values of a. 
Investigating this might lead to some insight into the structure of sets with \4A\ > \2A — 2A\. 
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of A A in the interval [7n + 1, 8n] with the elements of AA' in the interval [7n + 1 + 4m, 8n + 4m]. 
Since both A and A' are P£ sets, we know they each contain all possible elements more than n 
from their boundaries. Having accounted for the sums within n of the boundaries, \AA'\ — \AA\ = 
(7n + 4m + 1) - (7n + 1) = 4m. 

Now consider the differences in the same way. Again, the elements within n of the boundaries 
of 2A — 2A and 2 A' — 2 A' can be paired and both contain all elements that are not within n of the 
boundaries (since they are P^ sets). The filled middle interval in 2A — 2A is [— 3n + 2, 3n — 2] and 
m2A'-2A'is [-3n - 2m + 2, 3n + 2m - 2]. Thus \2A f -2A'\ - \2A-2A\ = 4m as desired. □ 

Lemma 2.3. Let A = L U R be a P^ set where L C [1, n] and R C [n + 1, 2n] and {1, 2n} G A. 
Form A' = iUOiUMU0 2 Ufi with O x = [n + l,n + k], M C [n + k + 1, n + k + m], 
O2 = [n + k + m + l,n + 2k + m] and R' = R + 2k + m. Ifk>n and M has no run of 3k — 2 
missing elements then A' is a P% set. 

Proof. We need to show that AA' Z> [n + 4, 7n + 8k + 4m] and 2A' - 2 A' D [-3n - 4A; - 2m + 
2, 3n + 4A; + 2m - 2] (because 44' C [4, 8n + 8k + Am] and 24' - 2 A' C [-4n - 4 fc - 2m + 
2,4n + 4A; + 2m-2]). 

First consider AA'. Since 1 6 L, 

L + L + L + Oi D [4 + n,3 + n + A;] 
L + L + Oi + Oi D [4 + 2n, 2 + 2n + 2k] 
L + 1 + 1 + O l D [4 + 3n,l + 3n + 3fc]. (2.3) 

Further, since 2n G i?, 

2 + 2 + 2 + # ^ [5n + 5k + Am + 3, 5n + 8k + Am] 
O2 + O2 + R' + R! D [6n + 6A; + Am + 2, 6n + 8A; + Am] 
2 + R' + R' + R' D [7ra + 7fc + 4m + l,7ra + 8Jfe + 4m]. (2.4) 

We now consider the sums of the O/s. We have 

Oi + Oi + Oi + Oi D [4 + 4n, 4n + 4/c] 

O x + 0! + Oi + 2 ^ [4 + An + fc + m, An + 5k + m] 

Oi + Oi + 2 + 2 3 [4 + 4n + 2k + 2m, 4n + 6k + 2m] 

01 + 2 + 02 + 02 3 [4 + 4n + 3k + 3m, An + 7k + 3m] 

2 + 2 + 2 + 02 ^ [4 + 4n + Ak + Am, An + 8k + Am]. (2.5) 

Finally, we study the sums involving M. We find 

0i + 0i + 0i + Af = (Oi + 0i + 0i) + Af = [3n + 3,3n + 3k] + M D [An+3k + l,An+k + m+3]. 

(2.6) 

This is because the smallest element in M must be at most n + 3k — 2 and the largest element in 
M is at least m + n — 2k + 3 (setting the bounds) and, because M has no runs of 3k — 2 missing 
elements and 30i has 3 k — 2 consecutive elements (closing the gaps). Similarly, 

Oi + Oi + 2 + M D [An + Ak + m + 1, An + 2k + 2m + 3] 

01 + 02 + 02 + M D [An + 5k + 2m + 1, An + 3k + 3m + 3] 

02 + O2 + O2 + M D [4n + 6fc + 3m + l,4ra + 4Jfe + 4m + 3]. (2.7) 
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Assembling these sums in the following order, and noting that the sums are contiguous, we get 
our desired result. 





L + L + L 


+ 0i 


D 


[4 + n, 3 + n + fc] 




L + L-\ 


^Ox 


+ Oi 


D 


[4 + 2n, 2 + 2n + 2fc] 




L 


+ OH 


-Ox 


+ Oi 


D 


[4 + 3n, 1 + 3n + 3k] 






+ OH 


-Ox 


+ 0i 


D 


[4 + 4n, 4n + 4k] 




Oi 


+ 0i- 


hOi 


+ M 


D 


[4n + 3k + l,4n + k + m + 3] 


Ox 


+ 0H 


-0i 


+ o 2 


D 


[4 + 4n + k + m, 4n + 5 A; + 


m] 


Ox 


+ 0i- 


h0 2 


+ M 


D 


[4n + 4& + m + 1, 4n + 2A; -f 


- 2m + 3] 


Ox 


+ OH 


-02 


+ 02 


D 


[4 + 4n + 2k + 2m, 4n + 6A; 


+ 2m] 


Ox 


+ o 2 - 


h0 2 


+ M 


D 


[4n + 5A; + 2m + 1, 4n + 3A; 


+ 3m + 3] 


Ox 


+ o 2 -\ 


-o 2 


+ o 2 


D 


[4 + 4n + 3A; + 3m, 4n + 7A; 


+ 3m] 


o 2 


+ o 2 - 


h0 2 


+ M 


D 


[4n + 6fc + 3m + 1, 4n + 4k 


+ 4m + 3] 


o 2 


+ o^ 


-02 


+ o 2 


D 


[4 + 4n + 4k + 4m, 4n + 8A; 


+ 4m] 


o 2 


+ o 2 - 




+ R' 


D 


[5ri + 5k + 4m + 3, 5n + 8A; 


+ 4m] 


Oo 


+ o 2 


+# 


+ R' 


D 


[6n + 6k + 4m + 2, Qn + 8A; 


+ 4m] 


o, 


i + R' 


+ R' + # 


D 


[7n + 7k + 4m + 1, 7n + 8A; 


+ 4m]. 



(2.8) 

Therefore 4A' D [4 + n, 7n + 8k + 4m] . 

Now consider 2A' — 2A' . Assembling the following sums (using the same logic concerning M): 



L + L 


-R' 


-02 


D 


[2- 


3n - 


- 4k — 2m, 1 — 3n 


— 3k — m] 


L + L 


-o 2 


-o 2 


D 


[2- 


2n - 


-4k- 2m, -2n - 


2k - 2m] 


L + Ox 




-o 2 


D 


[2- 


n — 


4k — 2m, — 1 — m 


-k- 2m] 


Ox + Ox 


-o 2 


-02 


D 


[2- 


4k- 


- 2m, -2 - 2m] 




M + Ox 


-o 2 


-o 2 


D 


[-1 


— k 


— 2m, 1 — 3k — m] 




Ox + 2 




-o 2 


D 


[2- 


3k - 


- m, k — m — 2] 




M + 2 


-o 2 


-02 


D 


[-1 


— m 


, 1 - 2k] 




o 2 + o 2 


-o 2 


-o 2 


D 


[2- 


2k,- 


-2 + 2k]. 





(2.9) 

Since these regions are all contiguous, 2 A' — 2 A' D [2 — 3n — 4k — 2m, 0]. Since 2 A' — 2 A' must 
be symmetric about 0, 2A' — 2A' D [2 — 3n — 4k — 2m, — 2 + 3n + 4k + 2m] as desired. Therefore 
A' is a P*-set □ 

Using these lemmas, we can now prove Theorem 12.11 

Proof of Theorem \2J] Just as in the proof in HMOSL we need to count the number of sets M of the 
form OiUMU0 2 of width r = 2 A; +m which may be inserted into a P+ set A with \4A\ > \2A-2A\. 
We are counting the exact same sets as in HMOS1 . except for them there M could not contain any 
run of k consecutive elements whereas ours cannot contain any run of 3A: — 2 missing elements. 
They could ensure their condition was satisfied by requiring each block oik/2 must contain at least 
one element; the analogous condition for us is that each block of size y — 1 must contain at least 
one element. We can ignore the minus 1, since it will not matter as r gets large. 
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Following the same logic as in HMOSL we end up needing the asymptotic behavior of the sum 




(2.10) 



Note the factors of l/2 2k arise from taking sets Oj that are k consecutive elements, and the factor 
(1 — ^72) is due to our condition of M having at least one element in blocks of size 3k/2. 
Fortunately, in anticipation of this work, HMOSI analyzed the more general sum 

I 1 " 25*) ' (2A1) 

k=n V ' 

showing there is a constant C > such that it is at least C/r°/ 6 (see their Lemma 3.1). Our sum is 
of the same form with parameters a = 2, b = c = 3/2, and thus our sum is at least 1 / r 4//3 . □ 

Remark 2.4. Our density bound above is related to the bounds from HMOSH for sum-dominant sets, 
and an improvement there translates to an improvement here. We describe a simple improvement 
one can make to the arguments in HMOSH . which allows us to replace the 1/r 4 they obtained for 
sum-dominant sets with a 1/r 2 . While a similar analysis would improve our results here, we choose 
not to do so as the real improvement comes from a better choice of the O's ( described in the next 
section) and not the middle. 

We appeal to an analysis of the probability m consecutive tosses of a fair coin has its longest 
streak of consecutive heads of length £ ( see IIScll ). While the expected value oft grows like log 2 (m/2), 
the variance converges to a quantity independent ofm, implying an incredibly tight concentration. 
If we take 0\ and 2 as before and of length k, we may take a positive percentage of all M's of 
length m to insert in the middle, so long as k = log 2 (m/2) — cfor some c. The size of A is negli- 
gible; the set has length essentially r = m + 2k. Of the 2 m+2k possible middles to insert, there are 
C2 m possibilities ( we have a positive percentage of M work, but the two O 's are completely forced 
upon us). This gives a percentage on the order of2 m /2 m+2k ; as k = log 2 (m/2) — c, this gives on 
the order of 1/r 2 as a lower bound for the percentage of sum-dominated sets, much better than the 
previous 1/r 4 . 

3. Constructing many A with \2A + 2A\ > \2A - 2A\, II 

We discuss improvements to the exponent in Theorem 12.11 The following two observations are 
very important in improving our exponent. 

• The O's always show up at least in pairs in the sums and differences used to prove A' was a 
P 4 -set, except in cases where they show up with L + L + L, R' + R' + R! or L + L — R' . 

• Each of L + L + L, R' + R! + R' and L + L — R' contain a run of 16 elements in a row. 

These two points allow us to relax our structure for each of the O's and still have all of the sums 
and differences just stated fill the necessary ranges. This greatly improves our exponent, as we lost 
a power due to the l/2 2fc factor from the O's. As long as each O contains its first and last possible 
element, each O has no run of 16 missing elements and 20 = O + O is full for both O's, A' will 
be a -P,t" set - This looser structure allows us to replace the l/2 2k with a much better factor and thus 
greatly improve our density bound. 

Theorem 3.1. There is a constant C > such that as r goes to infinity, the percentage of subsets 
of[l,r] with \4A\ > \2A - 2A\ is at least C/n r , where r = \ log 2 (256/255) w .001. 
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Proof. Instead of requiring that 0\ and 2 contain all elements in their ranges as before, we now 
only require that they contain the first 16 elements, the last element, and no runs of 16 consecutive 
non-chosen in between. While the old O's contributed l/2 2k to our sum, the new O's contribute 
significantly more. For use later in proving Theorem 11.11 we analyze this problem more generally, 
and force each O to contain the first / elements, the last element, and no run of / missed elements 
in between. We again use a crude bound to ensure that each O contains no runs of / blanks and 
force each O to contain at least one element in every block of f/2 elements. 

In each O we thus have at most 2k/ f blocks of length f/2. In each block, there are 

2 //2 

options 

and all but one contain at least one element. The fraction of subsets that work as O's is thus at least 

of/ 2 - \ \ 2k/f 

-5775-; • (31) 

To use in our sum, we want to represent this expression as 2~ ak . We find 



/ 2// 2 -l \ 
V 2// 2 ) 



2k/ f 



-ak 



2k , 2// 2 - 1 

J lQ g2 2//2 = -ak 

-2 2f' 2 - 1 
a = — log 2 2//2 . (3.2) 

Since, for our current purposes, / = 16, we find a — ^ log 2 |||. We know that our sum guarantees 
a bound of l/W 6 , we know from before that b = 3/2 and now know that a = 2a ~ 0.00142 
(because there are 2 O's). Thus there exists some constant C for which the percentage of subsets of 
[1, n] is greater than C/n r , where r = ~ log 2 (256/255) « .001 □ 

This construction could be pushed further by finding a 'better' A. If L + L + L, R + R + R and 
L + L — R contained longer runs, we would have more freedom in each O, and thus could form a 
better bound. Rather than look for more sets, however, to allow ourselves to push the bound even 
further, we modify our construction slightly, and add two more components to our A'. 

Starting with A as in Theorem 12. II (a P™ subset of [1, 2n] that contains 1 and 2n) we form 

A' = L\JF 1 U0 1 UMU0 2 UF 2 \JR', (3.3) 

where 

• L C [1, n] containing 1, 

• F 1 = [71+1,71+/], 

• 0\ C [n + f + l,n + f + k] containing the first / elements, the last element, and no runs 
of / missing elements, 

• Mc [n + f + k+l,n + f + k + m] with no runs of k missing elements, 

• 2 C [n + f + k + m+l,n + f + 2k + m] containing the first / elements, the last element, 
and no runs of / blanks, 

• F 2 = [n + / + 2k + m + 1, n + 2f + 2k + m], 

• R' C [n + 2f + 2k + m + 1, 2n + 2/ + 2k + m] containing 2n + 2f + 2k + m. 

By a method similar to that used in Lemma [2731 we can prove that these A's are Pf-sets. Since 
A has \4A\ > \2A - 2A\, we have \4A'\ > \2A' - 2A'\ (by LemmaP. 
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With this new construction, we now prove our best lower bound for the density of sets with 
\4A\ > \2A-2A\. 

Proof of Theorem [7771 For any fixed /, we can form 

A' = LUF 1 U0 1 UMU0 2 UF 2 UR / , (3.4) 

with A as in the proof of Theorem l3.ll To ensure there are no overly long runs of missing elements, 
we force each O to contain one element in every block of J/2 and M to contain one element in 
every block of k/2^ 

By allowing k to grow, and summing over our possible sets as we did in Theorems 12.11 and 13.11 
we know that the proportion of subsets [1, n] that have \AA\ > \2A — 2A\ is at least 

^ 1 / i _2 2 s I 2 - 1 

2^tfzy--2kj2j > a = Y log2 2// 2 ' (3 ' 5) 

From Lemma 3.1 of HMOSL this sum is at least C/r p , with 

2a -2 , 2f' 2 - 1 
p = 1/2 = 4 T log2 ^77^- (3 - 6) 

Since 

-2 2 f / 2 - 1 

we can force the bound to be better than C '/rf for any e > 0, completing the proof. □ 

4. Constructing Generalized MSTD sets with given differences 

We explore some consequences of our constructions. We first prove that given any x there is an 
Awith|4A| - \2A-2A\ = x. 

Proof of Theorem [O] We first consider negative x. Let 

S x = [1, \x\ + 2] U{2|x| + 3}. (4.1) 

Then 

4S X = [4,7|x| + 11] U {8|x| + 12}, 25' x -25 x = [-4|x|-4,4|x| + 4]. (4.2) 
Thus 14^1 - \2S X - 2S X \ = (7\x\ + 9) - (8|x| + 9) = = x as desired. 

For x = 0, let S = {0}. Then |4,Sb| - |25 - 2S \ = 1-1 = 0. 

We are left with positive values of x. Similar to Martin and O'Bryant's HMO I proof that \A + 
A\ — \A — A\ can equal any value, we deal with certain small values of x explicitly, then offer a 
method of construction for larger values of x. Let 

Si = {0,l,3,4,7,26,29,30,32,33,34};|S , i + S , i + 5 , i + S , i|-|S , i + iS , i-S , 1 -iS , i| = 1. (4.3) 

Now consider the positive values of x = 1 mod 4, so x — Ak + 1. Define 

S ik+ i = Si + {0,137, 274,..., 137A;}. (4.4) 



3 We could weaken this construction by appealing to results on the length of consecutive heads in tosses of a fair 



coin; see Remark |Z4| As this will not change the form of our final bound, we prefer to keep the exposition simple. 
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Then 

4S 4k+1 = {0 < s < 137(4£; + 1) - 1 : s £ 23 mod 137} 
2S 4k+1 - 2S 4k+1 = {-137(2A; + 1/2) < s < 137(2A; + 1/2) : s ^ 43,231}; (4.5) 

Thus |4S 4 fc+i| - |2^ 4 fc+i - 2S ik+1 \ = ((4k + 1) • 136) - ((4k + 1) • 135) = 4k + 1 as desired. 

Next we consider the positive values of x = mod 4. With S^k+i as in (14.41) . define 

S ik = S' 4fc+1 \{137}. (4.6) 
After some algebra we find that 2S± k — 2S 4k = 2S 4k+1 — 2S , 4fc+1 but 4S ik = 4S , 4fc+1 \{137}. Thus 
1 454*1 — \2S 4k — 2S 4k \ = |4jS4fc+i| — ^^fc+i — 2S ik+ i\ — 1 = 4k, (4.7) 

as desired. 

Next, we study the positive values of x = 2 mod 4. Again, with S^ k+ i as in (14.41) . define 

S 4fc - 6 = *W\{34}. (4.8) 

We have 

45 4fe _ 6 = 45 4fe+ i\{109, 133, 134, 135, 136, 246, 271, 272, 273, 383, 409, 410, 547} 
2S 4fc _ 6 - 2^-6 = 2S Ak+1 - 2S 4fc+ i\{-481, -480, -343, 343, 480, 481}. (4.9) 

Thus 

14^-61-12^-6-^-61 = 14^+11-12^+1-^+11 + 13-6 = 4^ + 1-7 = 4&-6, (4.10) 
as desired. 

Finally, the take care of the positive values of x = 3 mod 4. Here we set 

S 4k -i = ^4fc+i\{33}, (4.11) 

where as always S^+i is as in ( I4.4I ). We have 

454*-! = 4,S 4fc+1 \{133,135} 
2<S , 4fc-i — 25* 4 fc_i = 25*4A;+i — 2o 4 fc+i. (4.12) 

Thus 

|4<S4fe_i| — |254fe_i — 54jfc_i| = |4<S4fe+i| — |254fe+i — iS4fc+i| — 2 = 4k — 1, (4.13) 
completing the proof. □ 



Theorem 4.1. The minimum span for any set with \4A\ > \2A — 2A\ is 30. 

Proof. There are no subsets of [1, 30] with \4A\ > \2A — 2A\, which can be checked by brute force 
in a reasonable amount of time as 2 29 < 10 9 . Thus the minimum span cannot be less than 29. As 
A = {1, 2, 3, 5, 9, 24, 28, 30, 31} has \4A\ > \2A - 2A\, the minimum span must be 30. □ 
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